SEPT: Towards Scalable and Efficient Visual Pre-training
نویسندگان
چکیده
Recently, the self-supervised pre-training paradigm has shown great potential in leveraging large-scale unlabeled data to improve downstream task performance. However, increasing scale of real-world scenarios requires prohibitive computational costs and faces challenge uncurated samples. To address these issues, we build a task-specific framework from selection perspective based on simple hypothesis that samples with similar distribution target can bring substantial performance gains. Buttressed by hypothesis, propose first yet novel for Scalable Efficient visual Pre-Training (SEPT) introducing retrieval pipeline selection. SEPT leverage pre-trained model extract features entire dataset initialization. Then, specific task, retrievals most feature similarity each instance pre-training. Finally, pre-trains selected manner finetuning. By decoupling available upstream achieves high scalability efficiency pre-training, resulting architecture flexibility. Results various tasks demonstrate achieve competitive or even better compared ImageNet while reducing size training one magnitude without resorting any extra annotations.
منابع مشابه
Towards Scalable and Efficient Embedded Sensing Networks
Over the past decade, many embedded sensing networks have been deployed to pervasively monitor the physical world at previously infeasible or impractical spatial and temporal resolutions. The result of this novel sensing ability is deeper understanding and higher degree of control of the physical world. The decreasing cost and size of the sensor nodes will enable even larger scales of embedded ...
متن کاملTowards Efficient and Scalable Omniscient Debugging for Model Transformations
Model transformations (MTs) are central artifacts in modeldriven engineering (MDE) that define core operations on models. Like other software artifacts, MTs may possess defects (bugs). Some MDE tools provide support for debugging. In this paper, we describe an omniscient debugging technique. Our technique enhances stepwise execution support for MTs by providing the ability to traverse, in eithe...
متن کاملTowards efficient and scalable mediation: the AURORA approach
We develop a 2-tier, plug-and-play mediation model for accessing a large number of heterogeneous data sources. This model deenes a divide-and-conquer approach towards information integration. It is more suitable for applications such as electronic commerce than existing models. We also develop algebras that manipulate heterogeneous data, the mediation enabling algebras, that provide new techniq...
متن کاملTowards scalable and data efficient learning of Markov boundaries
We propose algorithms for learning Markov boundaries from data without having to learn a Bayesian network first. We study their correctness, scalability and data efficiency. The last two properties are important because we aim to apply the algorithms to identify the minimal set of features that is needed for probabilistic classification in databases with thousands of features but few instances,...
متن کاملA Diphone Sharing Method Towards Scalable Unit-training-based TTS
One of the most popular applications of Text to Speech (TTS) is in embedded devices. The resource limitation of embedded device requires the footprint of TTS system to be very small. Toshiba TTS for embedded device is a unit-training-based system and uses diphone as basic unit. The trained diphone inventory occupies a large part of the footprint. This paper proposes a diphone sharing method to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i2.25249